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1 

2 BACKGROUND OF THE INVENTION 

3 

4 

5 1. Field of the Invention 

6 

7 [0001] The present invention relates in general to computer programs, and more particularly to 

8 translating a path expression of an object oriented query into relational joins. 
9 

10 

1 1 2. Description of the Related Art 

12 

1 3 [0002] Enterprise JavaBeans (EJB) is a specification of an architecture for developing and 

14 deploying component-based distributed applications. Distributed applications developed using the 

15 Enterprise JavaBeans architecture are scalable, transactional, and multi-user secure. Such a 

16 distributed application may be written once, and then deployed on various server platforms 

17 supporting the Enterprise JavaBeans specification. An EJB architecture comprises an EJB server, 

1 8 EJB containers that execute on the EJB server, and EJB 's that execute in these EJB containers. EJB 

19 servers and EJB containers are developed and deployed by various middleware providers, and an 

20 application developer may use the EJB components to develop and deploy a component-based 

21 distributed application. 
22 

23 [0003] Enterprise Java Beans 2.0 (EJB 2.0) defines an EJB Query Language (EJBQL) which 

24 provides a solution for defining portable finder methods for container managed persistence (CMP) 

25 entity beans. Prior to EJB QL, the specification of CMP finder methods for a bean was vendor 

26 specific. If such a vendor specific bean is deployed into a different database having a different 

27 database schema mapping, then the vendor specific finder helper methods must be redefined. EJB 

28 Query Language makes the specification of finder methods easier and more portable. 
29 

30 [0004] The EJB Query Language provides a construct known as a path expression which specifies 

3 1 a navigation route from a previously referenced schema to a new schema. A path expression may 

SVL9-2002-0078-US1 3 



appear in an EJB QL query in either a SELECT clause, a WHERE clause, or a FROM clause, and 
thus path expressions affect both the scope and the results of an EJB QL query). However, although 
an EJB QL query may be converted into a SQL query to execute against a database, the path 
expressions are extensions not found in SQL. 

[0005] Path expressions may be translated to SQL joins operations when EJB ' s are mapped to 
relational tables. Path expressions in the SELECT clause are translated to a left outer join, path 
expressions in the FROM clause are translated to an inner join, and path expressions in the WHERE 
clause can be translated to either an inner join or an outer join depending on usage context. The 
translated SQL then consists a mixture of inner and left outer joins operations. Although a series of 
inner joins can appear in any order, the order of a mixture of inner joins and outer joins is critical for 
both the validity and performance of the query. 

[0006] Thus, there is a clearly felt need for an improved translation of a path expression of an 
object oriented query into relational joins. 
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1 SUMMARY OF THE INVENTION 

2 

3 [0007] Preferred embodiments of the present invention comprise a method, system, article of 

4 manufacture, and computer program product for translating a path expression of an object oriented 

5 query into relational joins. 
6 

7 [0008] In accordance with a preferred embodiment of the present invention, a path expression, 

8 comprising a navigation path through a relationship in a schema, in an object oriented query is 

9 translated to a relational database outer join by analyzing each path expression defined in each level 

10 of the object oriented query, and by identifying each path expression which can be a candidate for a 

1 1 translation to an outer join. The path expression are ordered starting with quantifiers defined in a 

12 FROM clause, to which are added each path expression identified as a candidate for a translation to 

13 an outer join. These ordered path expressions are input to a select operator for each level of the 

14 object oriented query, and the ordered path expressions are grouped sequentially based upon on a 

15 source-target dependency between ordered path expressions and based upon the identifications as a 

16 candidate for a translation to an outer join. A quantifier is created for each path expression, said 

17 quantifier comprising a variable representing a table in a relational database, and each grouped path 

1 8 expression is replaced with a related table in a relational database, and a translation of the object 

19 oriented query to a relational query is completed. 
20 

21 [0009] In accordance with an aspect of a preferred embodiment of the present invention, an 

22 optimization may also be performed on the grouped quantifiers to improve performance in which the 

23 optimization identifies quantifiers which can be a candidate for a translation to an inner join. 
24 

25 [0010] In accordance with another aspect of a preferred embodiment of the present invention, 

26 after the optimization, an inner join is generated for each quantifier which remains a candidate for a 

27 translation to an inner join, and an outer join is generated for each quantifier which remains a 

28 candidate for a translation to an outer join. 
29 

30 [0011] In accordance with another aspect of a preferred embodiment of the present invention, the 

3 1 optimization identifies a quantifier as a candidate for a translation to an inner join if a corresponding 
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path expression is defined with a NOT NULL foreign key. 

[0012] In accordance with another aspect of a preferred embodiment of the present invention, the 
optimization identifies a quantifier as a candidate for a translation to an inner join if a corresponding 
path expression is used in a FROM clause. 

[0013] In accordance with another aspect of a preferred embodiment of the present invention, the 
optimization identifies a quantifier as a candidate for a translation to an inner join if a LIKE, IN, or 
BETWEEN operator exists in a WHERE clause containing a corresponding path expression. 

[0014] In accordance with another aspect of a preferred embodiment of the present invention, the 
optimization identifies a quantifier as a candidate for a translation to an inner join if an EQUAL, 
LESS THAN, GREATER THAN, LESS THAN OR EQUAL, GREATER THAN OR EQUAL, NOT 
EQUAL, or NOT NULL operator exits in a WHERE clause. 

[0015] A preferred embodiment of the present invention has the advantage of providing improved 
translation of an object oriented query into a relational query. 

[0016] A preferred embodiment of the present invention has the advantage of providing improved 
translation of an object oriented query comprising a path expression into a relational query. 

[0017] A preferred embodiment of the present invention has the advantage of providing improved 
performance of a relational query translation of a path expression. 

[0018] A preferred embodiment of the present invention has the advantage of providing improved 
access and manipulation of database null values. 

[0019] A preferred embodiment of the present invention has the advantage of providing improved 
SELECT queries comprising clauses such as EMPTY, MEMBER OF, and EXIST. 
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1 BRIEF DESCRIPTION OF THE DRAWINGS 

2 

3 [0020] For a more complete understanding of the present invention and the advantages thereof, 

4 reference is now made to the Description of the Preferred Embodiment in conjunction with the 

5 attached Drawings, in which: 
6 

7 Figure 1 is a block diagram of a preferred embodiment of the present invention; 

8 

9 Figure 2 illustrates an object oriented hierarchy comprising a model of a plurality of related 

10 enterprise java beans; 
11 

12 Figure 3 illustrates application of a preferred embodiment of the present invention on the 

13 object oriented hierarchy of Figure 2 comprising a model of a plurality of related enterprise java 

14 beans; 
15 

16 Figure 4 illustrates a second application of the preferred embodiment of the present 

17 invention on the object oriented hierarchy of Figure 2 comprising a model of a plurality of related 

1 8 enterprise java beans; 
19 

20 Figure 5, Figure 6, Figure 7, and Figure 8 are flowcharts of method steps preferred in 

21 carrying out a preferred embodiment of the present invention; and 
22 

23 Figure 9 is a block diagram of a computer system used in performing a method of a 

24 preferred embodiment of the present invention, forming part of an apparatus of a preferred 

25 embodiment of the present invention, storing a data structure of a preferred embodiment of the 

26 present invention, and which may use an article of manufacture comprising a computer-readable 

27 storage medium having a computer program embodied in said medium which may cause the 

28 computer system to practice a preferred embodiment of the present invention. 
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1 DESCRIPTION OF THE PREFERRED EMBODIMENT 

2 

3 [0021] An embodiment of the invention is now described with reference to the figures where like 

4 reference numbers indicate identical or functionally similar elements. Also in the figures, the left 

5 most digit of each reference number corresponds to the figure in which the reference number is first 

6 used. While specific configurations and arrangements are discussed, it should be understood that 

7 this is done for illustrative purposes only. A person skilled in the relevant art will recognize that 

8 other configurations and arrangements can be used without departing from the spirit and scope of the 

9 invention. It will be apparent to a person skilled in the relevant art that this invention can also be 
10 employed in a variety of other devices and applications. 

11 

12 [0022] Figure 1 illustrates an e-business application environment 100 which facilitates the 

13 building, running, and management of multi-tiered, web-enabled e-business applications. The 

14 application environment 100 usually comprises three tiers: a first tier comprising a web browser 104; 

15 a second middle tier comprising an application server 108; and a third tier comprising a persistent 

16 store 112. The first tier web browser 104 is provided services through an HTTP server 114 by a 

17 servlet 116 executing on the application server 108. The servlet 116 may comprise one or more 

18 Enterprise JavaBeans (EJB) 118. These EJB's 118 use the persistent store 112 for container- 

19 managed persistence (CMP). A query against the persistent store 112 is mapped from EJBQL to the 

20 native query language of the persistent store 1 12. 
21 

22 [0023] Web browsers 104, pervasive devices, PCs and other tier one devices may access an HTTP 

23 server 114 on the middle tier 108 within the application server environment 100. This access may 

24 invoke J ava Server Pages (JSP) 120 or servlets 116, managed by the application server 108, that 

25 access the tier three persistent store 112 data using Java Database Connectivity (JDBC), SQL for 

26 Java (SQU), or JB Query Language (EJBQL). 
27 

28 [0024] The persistent store 112 stores state information for servlets 116 and EJB session beans 

29 118, and it serves as backend storage for bean-managed and container-managed persistence for EJBs 

30 118. JSPs 120 and servlets 116 may also indirectly access a remote data source 112 using EJBs 118, 

3 1 that are invoked using remote method invocation. In addition to web browsers 104, other types of 
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1 client applications can invoke EJBs 118 directly by remote method invocation. 
2 

3 [0025] Session and entity beans execute business logic on the middle tier 108. Both can customize 

4 their access to data using container-managed persistence. The EJB 2.0 draft specification includes an 

5 EJB Query Language that defines finder and select methods to facilitate the use of CMP by entity 

6 beans. For the entity beans 118 to access data in the persistent store 112, the EJBQL query 124 is 

7 translated into the native language of the persistent store, such as a SQL query 126, by an EJB query 

8 engine 122. 
9 

10 [0026] The translation and compilation of an EJBQL query 124 into a SQL query 126 comprises 

1 1 several steps: parsing the query 128, building a query graph model 130, rewriting the query 231, 

12 optimizing the query 234, and generating an executable access plan 136. 
13 

14 [0027] The query engine parser 128 parses the query 124 to validate the syntax and semantics, and 

15 to create a query graph model (QGM) 130 which is an internal, in-memory database that is used to 

1 6 represent the query throughout the query compilation process. 
17 

1 8 [0028] The query rewrite component 132 of the query engine 122 uses the global semantics 

19 provided in the query graph model 130 to transform the query 124 into a form that can be optimized 

20 more easily. For example, the query rewrite component 132 might move a predicate, altering the 

21 level at which it is applied and potentially improving query performance. 
22 

23 [0029] The optimizer portion 134 of the query engine 122 uses the query graph model 130 as input, 

24 and generates many alternative execution plans 136 for satisfying the user's request. It estimates the 

25 execution cost of each alternative plan 136, using the statistics for tables, indexes, columns and 

26 functions, and chooses the access plan 136 with the smallest estimated execution cost. The output 

27 from this step is an access plan 136. 
28 

29 [0030] In the code generation component 138, the query engine 122 uses the access plan 136 and 

30 the query graph model 130 to create an executable access plan 136, or section, for the query. Code 
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1 generation 138 uses information from the query graph model 130 to avoid repetitive execution of 

2 expressions that only need to be computed once for a query. Information about access plans 136 is 

3 stored in the system catalog tables 138. When the executable access plan 136 is executed, the 

4 database manager 140 uses the information stored in the system catalog tables 138 to determine how 

5 to access the data stored in tables 142 and provide results for the query. 
6 

7 [0031] In order to translate an EJB QL query containing a path expression to a valid SQL query 

8 with improved performance, the preferred embodiment of the present invention alters the operation 

9 of the above parser, query rewrite, optimization, and code generation components as described 
10 below. 

11 

12 [0032] Referring now to Figure 2, an application of a preferred embodiment of the present 

13 invention on an object oriented hierarchy is illustrated. Assume an application having an EJB model 

14 comprising a tree 200 of eight EJBs: BeanA 202, BeanB 204, BeanC 206, BeanD 208, BeanE 210, 

15 BeanF 212, BeanG 214, and BeanH 216. These beans are mapped to database tables A 218, B 220, 

16 C 222, D 224, E 226, F 228, G 220, and H 232, respectively. BeanA 202 has 1 : 1/M: 1 relationship 

17 (rb) 234 with BeanB 204, 1:1/M:1 relationship (rd) 236 with BeanD 208, and 1:1:/M:1 relationship 

1 8 (rf) 238 with BeanF 212. BeanB 204 has 1 : 1/M: 1 relationship (rc) 240 with BeanC 206. BeanD 208 

19 has 1 :M relationship (re) 242 with BeanE 210 and 1 : 1/M: 1 relationship (rh) 244 with BeanH 216. 

20 BeanF 212 has 1 : 1/M: 1 relationship (rg) 246 with BeanG 214. 
21 

22 [0033] The symbol "OJ" represents an outer join, symbol "D" represents an inner join, and the 

23 symbol "," represents a Cartesian product between two tables. An outer join is a relational algebra 

24 operator that performs an extended join operation in which the tuples or rows of one relation or table 

25 that have no counterpart in a second relation or table appear in the resulting relation concatenated 

26 with all null values. A Cartesian product is a relational algebra operator that produces a relation or 

27 table that contains all possible ordered concatenations or joinings of records from two existing tables 

28 that meet certain specified criteria on the data values. An inner join is equivalent to a Cartesian 

29 product followed by a select applied to the resulting table. A lower case first letter of a bean name 

30 represents an identification variable for each Bean. (e.g. "a" is identification variable for BeanA). 
31 
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1 [0034] Assume that the following EJB query 302 in the EJB query language is executed over the 

2 EJB model 200: 

3 EJB Query: 

4 Select a.rb.rc from BeanA a, BeanB b, in(a.rd.re)p 

5 where p.name = 'EJBQuery' and 

6 a.rf.rg is null and 

7 b.name = a.name 
8 

9 [0035] The EJB query 302 contains five path expressions, "a.rb.rc" 304 in the SELECT clause 306, 

10 "a" 308, "b" 310, and "a.rd.re" 312 in the FROM clause 314, and "a.rf.rg" 316 in the WHERE clause 

1 1 318. In accordance with the preferred embodiment of the present invention, each of these path 

12 expressions (304, 308, 310, 312, and 316) comprising navigation paths through relationships in a 

13 schema in the object oriented query 302 are translated to either a relational database outer join or a 

14 relational database inner join by the performance of the following described process. The process 

15 begins by identifying each path expression defined in each level of the object oriented query to 

16 identify path expressions for analysis. Thus, the process identifies the path expressions "a.rb.rc" 304 

17 in the SELECT clause, "a.rd.re" 312 in the FROM clause, and "a.rf.rg" 316 in the WHERE clause to 

18 identify a list of path expressions for analysis 320 comprising "a.rb", "a.rb.rc", "a.rd", "a.rd.re", 

19 "a.rf ', and "a.rf.rg". 
20 

21 [0036] The list of identified path expressions 320 is then analyzed to yield a path analysis of the 

22 FROM. SELECT, and WHERE clause path expressions. The path expression analysis produces a 

23 list 322 of path expressions to be evaluated as candidates for translation to an outer join comprising 

24 "a", "b", "a.rd", "a.rd.re", "a.rf \ "a.rf.rg", "a.rb", and "a.rb.rc". 
25 

26 [0037] From the list 322 produced by the path expression analysis, each path expression is 

27 evaluated to determine if the path expression is a candidate for a translation to an outer join. This 

28 evaluation applies rules, including but not limited to the following rules, to determine if a path 

29 expression is a candidate for a translation to a left outer join: 

30 If the path expression is in a SELECT, GROUP BY, ORDER BY, or HAVING clause; 

31 If the path expression is in a WHERE clause with a NULL or OR operator; 
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1 If the path expression is in a WHERE clause which is NULL; or 

2 If the path expression is in an outer join in a WHERE clause, then outer join candidate. 

3 Applying these rules to the identified path expressions "a", "b", "a.rd", "a.rd.re", "a.rf ', "a.rf.rg", 

4 "a.rb", and "a.rb.rc" of the list 322 of path expressions to be evaluated as candidates for translation to 

5 an outer join produces a list of outer join candidates 324 comprising path expressions "a.rf ', 

6 "a.rf.rg", "a.rb", and "a.rb.rc". 
7 

8 [0038] The path expressions are ordered beginning with the identified path expressions defined in 

9 the FROM clause 314, to which are concatenated the path expressions 324 identified as candidates 

10 for a translation to an outer join from any WHERE clause 318 or any SELECT clause 306, to which 

1 1 are concatenated any remaining path expressions from any WHERE clause 318 or any SELECT 

12 clause 306 which are not identified as candidates for a translation to an outer join. The FROM 

13 clause 314 ordering comprises path expression a 326, path expression b 328, path expression "a.rd" 

14 330, and path expression "a.rd.re" 332. The ordering of the FROM clause path expressions results in 

15 the initial sequence of path expressions "a, b, a.rd, a.rd.re" to which is concatenated the path 

16 expressions from the WHERE clause 318 and SELECT clause 306 identified as a candidate for a 

17 translation to an outer join. The path expressions identified as a candidate for a translation to an 

1 8 outer join comprise path expressions "a.rf ' 334, "a.rf.rg" 336, "a.rb" 338, and "a.rb.rc" 340. The 

19 remaining path expressions from the WHERE clause 318 and SELECT clause 306 which are not 

20 identified as candidates for a translation to an outer join comprise path expressions "b" 342, "a.rd" 

21 344, and "a.rd.re" 346. This produces the following path expression ordering 348: 

22 a, b, a.rd, a.rd.re, a.rf, a.rf.rg, a.rb, a.rb.rc, b. a.rd, a.rd.re 
23 

24 [0039] These ordered path expressions 348 are input to a select operator for each level of an object 

25 oriented query, and the ordered path expressions 348 are grouped sequentially based upon the 

26 identifications as a candidate for a translation to an outer join (outer join candidates preceding inner 

27 join candidates), and based upon a source-target dependency between the ordered path expressions 

28 348. In this source-target dependency ordering, a source of a relationship precedes a target of a 

29 relationship, i.e., a 350 representing the source BeanA 202 of the hierarchy preceding all other path 

30 expressions, and a.rf 352 preceding a.rf.rg 354 due to BeanG 232 being the target and BeanF 212 
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1 being the source in relationship rg 246 and due to BeanF 212 being the target and BeanA 202 being 

2 the source in relationship rf 238 . Duplicate path expressions are also eliminated. In this example, 

3 duplicates b 328, a.rd 330, and a.rd.re 332 are eliminated from the path expressions resulting from 

4 the FROM clause as b 342, a.rd 344, and a.rd.re 346 already appear in the path expression ordering 

5 348 in path expressions resulting from the outer join candidate portions of the path expression 

6 ordering 348. This produces the following path expression grouping 356: 

7 a, a.rf, a.rf.rg, a.rb, a.rb.rc, b, a.rd, a.rd.re 
8 

9 [0040] A quantifier is then created for each path expression in the grouping wherein each 

10 quantifier comprises a variable representing a table in a relational database. A quantifier ql 358 is 

1 1 created for a table A; q2 360 for table F; q3 362 for table G; q4 364 for table B; q5 366 for table C; 

12 q6 368 for table B; q7 370 for table D; and q8 372 for table E. In the quantifier grouping 358 

1 3 through 372, each grouped quantifier is replaced with the quantifier and its corresponding related 

14 table in a relational database to produce the following table-quantifier sequence 374: 

15 Aql LOJ Fq2 LOJ Gq3 LOJ Bq4 LOJ Cq5 LOJ Bq6 LOJ Dq7 LOJ Eq8 
16 

17 [0041] An optimization may then be performed upon the table-quantifier sequence 374 to 

1 8 determine quantifiers which are a candidate for an inner join. The use of an outer join decreases 

19 performance of an EJB query in the database. The preferred embodiment of the present invention 

20 analyzes the EJB query, and identifies situations in which an outer join can be converted to an inner 

21 join in order to improve the performance of the EJB query. The preferred embodiment analyzes 

22 portions of a path expression and applies the following rules to the analyzed portions of a path 

23 expression to determine which outer joins to convert to inner joins: 

24 If a corresponding path expression is used in a FROM clause, then build as an inner 

25 join. 

26 If a LIKE, IN, or BETWEEN operator exists in a WHERE clause containing a 
2 ? corresponding path expression, then build as an inner join. 

2 » If an EQUAL, LESS THAN, GREATER THAN, LESS THAN OR EQUAL, 

29 GREATER THAN OR EQUAL, NOT EQUAL, or NOT NULL operator exits in a WHERE 

30 clause, then build as an inner join. 

31 In EJB query 302, "b", "a.rd", and "d.re" are path expressions used in FROM clause. Thus, "b", 
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1 "a.rd", and "d.re" along with their corresponding tables and quantifiers are candidates for an inner 

2 join. The inner join optimization would also move the inner join candidates to the end of the table- 

3 quantifier sequence; however, that is not necessary in this example as the inner join candidates are 

4 already at the end of the table-quantifier sequence. The inner join optimization yields the following 

5 optimized table-quantifier sequence 376: 

6 Aql LOJ Fq2 LOJ Gq3 LOJ B q4 LOJ Cq5 , B q6 , Dq7 , Eq8 

7 Alternatively, table-quantifier sequence 376 may be directly generated by the above replacing step in 

8 lieu of table quantifier 374 if during the replacing step, path expressions used in a WHERE clause 

9 are replaced by an inner join of the table and quantifier, and moved to the end of the table-quantifier 
10 sequence. 

11 

12 [0042] A translation of the object oriented query to a relational query based upon this table- 

13 quantifier sequence 376 is then completed to produce the following relational query 378 in which 

14 q3.* means return all columns for table C and in which table G is assumed to have a primary key of 

15 "pkey": 

16 Select q5.* from A ql LOJ F q2 LOJ G q3 LOJ B q4 LOJ C q5 , B q6 , D q7 , E q8 

17 where (q8."name" = 'EJBQuery") and 

18 (q3."pkey" is NULL) and 

19 (q8.fk = q7.pk)and 

20 (q6."name" = ql."name") and 

21 (Q7.pk = ql.fk) 
22 

23 [0043] The following second example illustrates a translation of an EJB query 402 with a WHERE 

24 clause containing an EQUAL operator and a NULL operator which is executed over the EJB model 

25 200: 

26 EJB Query 

27 Select a.rd.re fromBeanAa 

28 where a.rd.re.name = "EJBQuery" and 

29 a.rd.rh.name is null. 

30 The preferred embodiment of the present invention translates this EJB query into the following SQL 

31 query: 
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1 SQL Query 

2 Select q3.* from A ql LOJ Dq2 LOJ Hq4 , Eq3 

3 where (q3. M name n = "EJBQuery") and 

4 (q4."name" is NULL) 
5 

6 [0044] More specifically, the EJB query 402 contains four path expressions, "a.rd.re" 404 in the 

7 SELECT clause 406, "a" 408 in the FROM clause 410, and "a.rd.re" 412 and "a.rd.rh" 414 in the 

8 WHERE clause 416. In accordance with the preferred embodiment of the present invention, each of 

9 these path expressions (404, 408, 412, and 414) comprising navigation paths through relationships in 

10 a schema in the object oriented query 402 are translated to either a relational database outer join or a 

1 1 relational database inner join by the performance of the following described process. The process 

12 begins by identifying each path expression defined in each level of the object oriented query to 

13 identify path expressions for analysis. The process identifies the path expressions "a.rd.re" 404 in 

14 the SELECT clause 406, "a.rd.re" 412 in the WHERE clause 416, and "a.rd.rh" 414 in the WHERE 

15 clause 416 resulting in the list of identified path expressions 418. 
16 

17 [0045] The list of identified path expressions 418 is then analyzed to yield a path analysis of the 

18 FROM. SELECT, and WHERE clause path expressions. The path expression analysis produces a 

19 list 420 of path expressions to be evaluated as candidates for translation to an outer join comprising 

20 "a", "a.rd", "a.rd.re", and "a.rd.rh". 
21 

22 [0046] From the list produced by the path expression analysis 420, each of path expression is 

23 evaluated to determine if the identified path expression is a candidate for a translation to an outer 

24 join. This analysis applies rules, including but not limited to the following rules, to determine if the 

25 identified path expression is a candidate for a translation to an outer join: 

26 If the path expression is in a SELECT, GROUP BY, ORDER BY, or HAVING clause; 

27 If the path expression is in a WHERE clause with a NULL or OR operator; 

28 If the path expression is in a WHERE clause which is NULL; or 

29 If the path expression is in an outer join in a WHERE clause, then outer join candidate. 

30 Applying these rules to the identified path expressions "a.rd.re" 404, "a.rd.re" 412, and "a.rd.rh" 414 

31 of the list 420 of path expressions to be evaluated as candidates for translation to an outer join 
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1 produces a list of outer join candidates 422 comprising the path expressions "a.rd", "a.rd.re", and 

2 "a.rd.rh". 
3 

4 [0047] The path expressions are ordered beginning with the path expressions defined in the FROM 

5 clause 410, to which are concatenated the path expressions 422 identified as candidates for a 

6 translation to an outer join from any WHERE clause 416 or any SELECT clause 406, to which are 

7 concatenated any remaining path expressions from any WHERE clause 416 or any SELECT clause 

8 406 which are not identified as candidates for a translation to an outer join. The FROM clause 410 

9 ordering comprises a 408 resulting in an initial sequence of path expressions "a" 424 to which is 

10 concatenated the path expressions from the WHERE clause 416 and SELECT clause 406 identified 

11 as a candidate for a translation to an outer join. The path expressions identified as a candidate for a 

12 translation to an outer join comprise path expressions "a.rd" 426, "a.rd.re" 428, and "a.rd.rh" 430. 

13 The remaining path expression from the WHERE clause 416 and SELECT clause 406 which is not 

14 identified as a candidate for a translation to an outer join is path expressions "a.rd.re" 432. This 

15 produces the following path expression ordering 434: 

16 a, a.rd, a.rd.re, a.rd.rh, a.rd.re 
17 

18 [0048] These ordered path expressions 434 are input to a select operator for each level of an object 

19 oriented query, and the ordered path expressions 434 are grouped sequentially based upon the 

20 identifications as a candidate for a translation to an outer join (outer join candidates preceding inner 

21 join candidates), and based upon a source-target dependency between the ordered path expressions 

22 434. In this source-target dependency ordering, a source of a relationship precedes a target of a 

23 relationship, i.e., a 436 representing the source BeanA 202 of the hierarchy preceding all other path 

24 expressions, and a.rd 438 preceding a.rd.re 440 due to BeanD 208 being the target and BeanA 202 

25 being the source in relationship rd 236 and due to BeanE 210 being the target and BeanD 208 being 

26 the source in relationship re 242 . The source-target dependency ordering also causes a.rd.re 440 to 

27 precede "a.rd.rh" 442. Duplicate path expressions are also eliminated. In this example, duplicate 

28 "a.rd.re" 432 is eliminated from the path expressions resulting from the SELECT clause 406 as 

29 "a.rd.re" 428 already appears in the path expression ordering 434 in path expressions resulting from 

30 the outer join candidate portions of the path expression ordering 434. This produces the following 
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1 path expression grouping 444: 

2 a, a.rd, a.rd.re, a.rd.rh 
3 

4 [0049] A quantifier is then created for each path expression in the grouping wherein each quantifier 

5 comprising a variable representing a table in a relational database. A quantifier ql 446 is created for 

6 a table A; q2 448 for table D; q3 450 for table E; and q4 452 for table H. In the quantifier grouping 

7 446 through 452, each grouped quantifier is replaced with the quantifier and its corresponding 

8 related table in a relational database to produce the following table-quantifier sequence 454: 

9 Aql LOJ Dq2 LOJ Eq3 LOJ Hq4 
10 

1 1 [0050] The optimization upon the table-quantifier sequence 454 is performed to determine 

1 2 quantifiers which are a candidate for an inner join. In EJB query 402, "a.rd.re" is a path expression 

13 used in a WHERE clause containing an EQUAL operator, and "a.rd.re" along with its corresponding 

14 table and quantifier are candidates for an inner join which are moved to the end of the table- 

15 quantifier sequence yielding the following optimized table-quantifier sequence 456: 

16 A ql LOJ D q2 LOJ Hq4 , Eq3 
17 

18 [0051] A translation of the object oriented query to a relational query based upon this table- 

19 quantifier sequence 456 is then completed to produce the following relational query 458 in which 

20 q3.* means return all columns for table E: 

21 Select q3.* from Aql LOJ Dq2 LOJ Hq4 , Eq3 

22 where (q3. "name" = 'EJBQuery") and 

23 (q4."name" is NULL) 
24 

25 [0052] Referring now to Figures 5, 6, 7, and 8, the flowcharts 500, 600, 700, and 800 illustrate the 

26 operations preferred in carrying out the preferred embodiment of the present invention. In the 

27 flowcharts, the graphical conventions of a diamond for a test or decision and a rectangle for a process 

28 or function are used. These conventions are well understood by those skilled in the art, and the 

29 flowcharts are sufficient to enable one of ordinary skill to write code in any suitable computer 

30 programming language. 
31 
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1 [0053] After the start 505 of the process 500, process block 510 analyzes each path expression 

2 defined in each level of the object oriented query, and process block 515 identifies each path 

3 expression which can be a candidate for a translation to an outer join. Process block 520 then orders 

4 the path expressions starting with path expressions defined in a FROM clause, adding to the FROM 

5 clause path expressions, each path expression identified as a candidate for a translation to an outer 

6 join, and making the ordered path expressions as input to a select operator for each level of the object 

7 oriented query. Process block 525 groups the ordered path expressions sequentially based upon on a 

8 source-target dependency between ordered path expressions and based upon the identifications as a 

9 candidate for a translation to an outer join. Process block 530 creates a quantifier for each path 

10 expression, said quantifier comprising a variable representing a table in a relational database. 

1 1 Thereafter, process block 535 replaces each grouped quantifier with a related table in a relational 

12 database. Process block 540 then completes a translation of the object oriented query to a relational 

13 query. The process ends at process block 535. 
14 

15 [0054] Referring now to Figure 6, the flowchart 600 illustrates the optimization which may be 

16 performed after process block 535 and before process block 540 in an alternative embodiment of the 

17 present invention. After process block 535 replaces each grouped quantifier with a related table in a 

18 relational database, process block 605 performs optimization on the grouped quantifiers, said 

19 optimization identifying quantifiers which can be a candidate for a translation to an inner join. 

20 Thereafter, process block 610 moves inner join candidates to the end of the table-quantifier 

21 sequence, and process block 615 generates an inner join for each quantifier which remains after 

22 optimization a candidate for a translation to an inner join. Process block 620 generates an outer join 

23 for each quantifier which remains after optimization a candidate for a translation to an outer join. 

24 Processing then continues to process block 540 which completes the translation of the object 

25 oriented query to a relational query. 
26 

27 [0055] Figure 7 and Figure 8 illustrate an expansion of process block 605 which performs the 

28 optimization on the grouped quantifiers to identify the quantifiers which can be a candidate for a 

29 translation to an inner join. Decision block 705 through decision block 730 implement rules which 

30 process each quantifier and identify a quantifier as a candidate for a translation to an inner join, and 
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1 decision block 810 through decision block 825 implement rules which process each quantifier and 

2 identify a quantifier as a candidate for a translation to an outer join. 
3 

4 [0056] Referring now to Figure 7, flowchart 700 illustrates the rules which identify a quantifier as 

5 a candidate for a translation to an inner join. Decision block 710 determines if a corresponding path 

6 expression is used in a FROM clause. If not, then decision block 715 determines if a LIKE, IN, or 

7 BETWEEN operator exists in a WHERE clause containing a corresponding path expression. If not, 

8 then decision block 725 determines if a WHERE clause contains an EQUAL, LESS THAN, 

9 GREATER THAN, LESS THAN OR EQUAL, GREATER THAN OR EQUAL, NOT EQUAL. If 

10 not, then decision block 730 determines if a NOT NULL operator exits in a WHERE clause. If not, 

1 1 then control passes to decision block 810 on Figure 8, illustrated by flowchart connectors A, 740 on 

12 Figure 7 and 805 on Figure 8. 
13 

14 [0057] Returning now to decision block 710 through decision block 730, if any of these decision 

15 blocks determine that the tested condition is true, then control passes to process block 735 which 

16 identifies the quantifier as a candidate for a translation to an inner join. Control then returns to 

17 decision block 710 to process the next quantifier, illustrated by flowchart connectors B, 745 and 750 

18 on Figure 7. 
19 

20 [0058] Referring now to Figure 8, flowchart 800 illustrates the rules which identify a quantifier as 

21 a candidate for a translation to an outer join. Decision block 810 determines if the path expression is 

22 in a SELECT, GROUP BY, ORDER BY, or HAVING clause. If not, then decision block 815 

23 determines if the path expression is in a WHERE clause with a NULL or OR operator. If not, then 

24 decision block 820 determines if the path expression is in a WHERE clause which is NULL. If not, 

25 then decision block 825 determines if an outer join is in a WHERE clause. If not, then control 

26 returns to decision block 705 on Figure 7 to process the next quantifier. This is illustrated by 

27 flowchart connectors B, 835 on Figure 8 and 750 on Figure 7. 
28 

29 [0059] Returning now to decision block 810 through decision block 825, if any of these decision 

30 blocks determine that the tested condition is true, then control passes to process block 830 which 
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1 identifies the quantifier as a candidate for a translation to an outer join. Thereafter, control returns to 

2 decision block 705 on Figure 7 to process the next quantifier. This is illustrated by flowchart 

3 connectors B, 835 on Figure 8 and 750 on Figure 7. 
4 

5 [0060] With reference now to the figures, and in particular with reference to Figure 9, there is 

6 depicted a pictorial representation of a computer system 900 which may be utilized to implement a 

7 method, system, article of manufacture, data structure, and computer program product of preferred 

8 embodiments of the present invention. The block diagram of Figure 9 illustrates a computer system 

9 900 used in performing the method of the present invention, forming part of the apparatus of the 

10 present invention, and which may use the article of manufacture comprising a computer-readable 

1 1 storage medium having a computer program embodied in said medium which may cause the 

12 computer system to practice the present invention. The computer system 900 includes a processor 

13 902, which includes a central processing unit (CPU) 904, and a memory 906. Additional memory, in 

14 the form of a hard disk file storage 908 and a computer-readable storage device 910, is connected to 

15 the processor 902. Computer-readable storage device 910 receives a computer-readable storage 

16 medium 912 having a computer program embodied in said medium which may cause the computer 

17 system to implement the present invention in the computer system 900. The computer system 900 

1 8 includes user interface hardware, including a mouse 914 and a keyboard 916 for allowing user input 

19 to the processor 902 and a display 918 for presenting visual data to the user. The computer system 

20 may also include a printer 920. 
21 

22 [0061] Using the foregoing specification, the invention may be implemented using standard 

23 programming and/or engineering techniques using computer programming software, firmware, 

24 hardware or any combination or sub-combination thereof. Any such resulting program(s), having 

25 computer readable program code means, may be embodied within one or more computer usable 

26 media such as fixed (hard) drives, disk, diskettes, optical disks, magnetic tape, semiconductor 

27 memories such as Read-Only Memory (ROM), Programmable Read-Only Memory (PROM), etc., or 

28 any memory or transmitting device, thereby making a computer program product, i.e., an article of 

29 manufacture, according to the invention. The article of manufacture containing the computer 

30 programming code may be made and/or used by executing the code directly or indirectly from one 

3 1 medium, by copying the code from one medium to another medium, or by transmitting the code over 
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1 a network. An apparatus for making, using, or selling the invention may be one or more processing 

2 systems including, but not limited to, central processing unit (CPU), memory, storage devices, 

3 communication links, communication devices, servers, input/output (I/O) devices, or any sub- 

4 components or individual parts of one or more processing systems, including software, firmware, 

5 hardware or any combination or sub-combination thereof, which embody the invention as set forth in 

6 the claims. User input may be received from the keyboard, mouse, pen, voice, touch screen, or any 

7 other means by which a human can input data to a computer, including through other programs such 

8 as application programs, databases, data sets, or files. 
9 

10 [0062] One skilled in the art of computer science will easily be able to combine the software 

1 1 created as described with appropriate general purpose or special purpose computer hardware to 

12 create a computer system and/or computer sub-components embodying the invention and to create a 

13 computer system and/or computer sub-components for carrying out the method of the invention. 

14 Although the present invention has been particularly shown and described with reference to a 

15 preferred embodiment, it should be apparent that modifications and adaptations to that embodiment 

16 may occur to one skilled in the art without departing from the spirit or scope of the present invention 

17 as set forth in the following claims. 
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